首页> 外文OA文献 >Virtual vs. Real: Trading Off Simulations and Physical Experiments in Reinforcement Learning with Bayesian Optimization

【2h】

Virtual vs. Real: Trading Off Simulations and Physical Experiments in Reinforcement Learning with Bayesian Optimization

机译：虚拟与真实：交易模拟和物理实验贝叶斯优化的强化学习

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

In practice, the parameters of control policies are often tuned manually.This is time-consuming and frustrating. Reinforcement learning is a promisingalternative that aims to automate this process, yet often requires too manyexperiments to be practical. In this paper, we propose a solution to thisproblem by exploiting prior knowledge from simulations, which are readilyavailable for most robotic platforms. Specifically, we extend Entropy Search, aBayesian optimization algorithm that maximizes information gain from eachexperiment, to the case of multiple information sources. The result is aprincipled way to automatically combine cheap, but inaccurate information fromsimulations with expensive and accurate physical experiments in acost-effective manner. We apply the resulting method to a cart-pole system,which confirms that the algorithm can find good control policies with fewerexperiments than standard Bayesian optimization on the physical system only.

机译：实际上，控制策略的参数通常是手动调整的，这既耗时又令人沮丧。强化学习是一种有前途的选择，旨在使这一过程自动化，但通常需要太多的实验才能实际应用。在本文中，我们通过利用仿真中的先验知识为大多数机器人平台提供的现有知识，提出了解决该问题的方案。具体来说，我们将熵搜索（一种贝叶斯优化算法）扩展到多个信息源的情况，该算法使每个实验的信息获取最大化。结果是一种以经济有效的方式自动将廉价但不准确的模拟信息与昂贵而精确的物理实验相结合的方法。我们将得到的方法应用于小车极点系统，这证实了该算法仅在物理系统上就能以比标准贝叶斯优化更少的实验找到良好的控制策略。

著录项

作者
Marco, Alonso; Berkenkamp, Felix; Hennig, Philipp; Schoellig, Angela P.; Krause, Andreas; Schaal, Stefan; Trimpe, Sebastian;
展开▼
作者单位

展开▼
年度 2017
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Eye-Tracking-Based Classification of Information Search Behavior Using Machine Learning: Evidence from Experiments in Physical Shops and Virtual Reality Shopping Environments [J] . Pfeiffer Jella, Pfeiffer Thies, Meissner Martin, Information Systems Research . 2020,第3期

机译：基于眼跟踪的信息搜索行为分类使用机器学习：从实体商店和虚拟现实购物环境中的实验证据
2. Can a virtual reality surgical simulation training provide a self-driven and mentor-free skills learning? Investigation of the practical influence of the performance metrics from the virtual reality robotic surgery simulator on the skill learning and associated cognitive workloads [J] . Lee Gyusung I., Lee Mija R. Surgical Endoscopy . 2018,第1期

机译：虚拟现实外科仿真培训是否可以提供自动和导师的技能学习？从虚拟现实机器人手术模拟器对技能学习和相关认知工作量的实际影响调查绩效指标的实际影响
3. Is Motor Simulation Involved During Foreign Language Learning? A Virtual Reality Experiment: [J] . Claudia Repetto, Barbara Colombo, Giuseppe Riva SAGE Open . 2015,第4期

机译：在外语学习过程中是否参与了运动仿真？虚拟现实实验：
4. Virtual vs. real: Trading off simulations and physical experiments in reinforcement learning with Bayesian optimization [C] . Alonso Marco, Felix Berkenkamp, Philipp Hennig, IEEE International Conference on Robotics and Automation . 2017

机译：虚拟与真实：通过贝叶斯优化在强化学习中权衡模拟和物理实验
5. Play at work, possibly save lives: Online basic LMS instruction vs. Virtual Reality simulation [D] . Commini, Michael F. 2016

机译：在工作中娱乐，可能挽救生命：在线基本LMS指导与虚拟现实仿真
6. Reinforcement learning for bluff body active flow control in experiments and simulations [O] . Dixia Fan, Liu Yang, Zhicheng Wang, 2020

机译：实验和模拟中的Bluff身体主动流量控制的加固学习
7. Virtual Fit vs. Physical Fit - How Well Does 3D Simulation Represent the Physical Reality [O] . Flora ZANGUE, Christian PIRCH, Anke KLEPSER, 2020

机译：虚拟合适与体质 - 3D模拟代表物理现实的程度如何

Virtual vs. Real: Trading Off Simulations and Physical Experiments in Reinforcement Learning with Bayesian Optimization

摘要

著录项

相似文献

相关主题

期刊订阅